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© Data processing apparatus. 

© A data processing apparatus, having a visible 
register map for associating physical registers with 
logical registers. Instructions involving register-to- 
register transfers are executed by altering the asso- 
ciation between the physical registers and the logical 
registers, without actually transferring data between 
the registers, so as to avoid logically redundant 
operations and to take such instructions out of the 
critical path of execution. 
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Background to the invention 

This invention relates to data processing ap- 
paratus. 

It has been found that, in a typical data pro- 
cessing system, a large proportion of instructions 
merely move data between different storage and 
register locations and only a minority actually cre- 
ate new values. These data movements are logi- 
cally redundant, and the object of the present in- 
vention is to provide a way of reducing the number 
of such data movements, so as to increase the 
performance and efficiency of the system. 

Summary of the invention 

According to the present invention there is 
provided a data processing apparatus comprising a 
plurality of physical registers, means for associat- 
ing the physical registers with a plurality of logical 
registers, and means for executing instructions in- 
volving register-to-register transfers by altering the 
association between the physical registers and the 
logical registers, without actually transferring data 
between the registers. 

A further feature of the present invention is that 
since data movement instructions are executed 
solely by altering association information, such 
instructions can be executed before the contents of 
the source register have been computed. This 
eliminates the execution of such instructions from 
the critical path of execution. 

Brief description of the drawings 

Figure 1 is a schematic block diagram of a 
data processing apparatus in accordance with the 
invention. 

Figure 2 shows the processing unit of the ap- 
paratus in more detail. 

Figure 3 shows an alternative form of the pro- 
cessing unit. 

Description of the embodiments of the invention 

Two embodiments of the invention will now be 
described by way of example with reference to the 
accompanying drawings. 

Referring to Figure 1, this shows a first data 
processing apparatus in accordance with the inven- 
tion. 

The apparatus comprises a processing unit 10, 
a memory 12, and a register file RF containing a 
set of physical hardware registers 14. The memory 
12 is organised in a conventional manner as a 
combination of a main memory 15 and a cache 
memory 16. 

The apparatus also includes an instruction 



fetch unit 18, which fetches sequences of instruc- 
tions from the memory 12. These instructions can 
be grouped into four classes as follows: 
ARITHMETIC: perform an arithmetic or logical op- 
5 eration on operands in specified source registers, 
and write the results into a specified destination 
register. 

MOVE: logically move the contents of a specified 
register into another specified register. 

w LOAD: compute a memory address, by adding the 
value in one or more specified registers (and possi- 
bly a literal displacement value), read the memory 
at that address, and write the resulting value to a 
specified register. 

75 STORE: compute a memory address by adding the 
value in or on more specified registers (and possi- 
bly a literal displacement value), read a specified 
register, and write the resulting value to the mem- 
ory at the calculated address. 

20 Referring now to Figure 2, this shows the pro- 

cessing unit 10 in more detail. The instructions 
fetched by the instruction fetch unit are passed to a 
register allocation unit 20. This unit has access to a 
free register list 21 which holds a list of free 

25 (unallocated) physical registers, and to a visible 
register map (VRM) 22, which holds a table indicat- 
ing associations between the physical (hardware) 
registers and the logical register identifiers used by 
the instructions. 

30 The register allocation unit 20 can handle sev- 
eral instructions concurrently, and analyses these 
instructions to determine their register usage and 
dependencies. The unit allocates physical regis- 
ters, if necessary, from the free register list 21, and 

35 modifies each instruction, using the VRM, so as to 
replace logical register identifiers in the instructions 
with physical register numbers. 

The register allocation unit 20 also checks 
whether all the source registers (i.e., the physical 

40 registers containing the required operand or ad- 
dress values for the instruction) are valid, and that 
a physical register can be allocated to hold the 
result of the instruction. 

if a logical register identifier is mapped onto a 

45 physical register whose contents are not yet valid, 
then any instruction that requires to use the value 
in that logical register as a source operand cannot 
be scheduled until a prior instruction has been 
completed to write the required value to the phys- 

50 ical register. As results are computed and written to 
the physical registers, any instructions awaiting 
these values become eligible for execution. 

When an ARITHMETIC instruction becomes 
eligible for execution, it is passed to an execution 

55 pipeline comprising a register read stage 24, an 
execution stage 26, and a register write stage 28. 

In the case of a MOVE instruction, the register 
allocation unit can immediately update the VRM so 
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that the destination register identifier is now 
mapped onto the same physical register as the 
source register identifier. No further processing of 
the instruction is required. 

Note that it is not necessary for the value in the 
source register to have been previously computed 
for such a MOVE instruction to be executed: only 
register identifiers are manipulated. 

If a value is loaded into a logical register, and 
subsequently another logical register is associated 
with the same value (as the result of a MOVE 
instruction), both logical registers are now asso- 
ciated with the same physical register. If either of 
the original logical registers is now modified, a new 
physical register must be allocated for it, and the 
association information in the VRM updated ac- 
cordingly. 

STORE and LOAD instructions are processed 
by means of an address calculation pipeline and a 
memory access pipeline. The address calculation 
pipeline consists of a base register read stage 34, 
an address generation stage 36, an alias check 
stage 38, and an alias generation stage 40. The 
memory access pipeline comprises an address 
translation stage 42, a cache/register read stage 
44, and a cache/register write stage 46. The alias 
check and alias generation stages 38, 40 have 
access to a store alias map (SAM) 48 which is a 
table indicating associations between memory ad- 
dresses and physical registers. 

In the case of a STORE instruction, the alias 
generation stage 40 makes an entry in the SAM 48 
to indicate that the contents of the store at the 
calculated address can now be found in the phys- 
ical register to which the source register specified 
in the instruction is mapped. This allows subse- 
quent instructions to access the data without the 
need for a memory access. If the calculated ad- 
dress corresponds to an existing entry in the SAM, 
the old association is removed, and the corre- 
sponding physical register then becomes free, and 
is added to the free register list 21 . 

As in the case of MOVE instructions, it is not 
necessary for the value in the source register to 
have been previously computed before creation or 
update of a SAM entry. However, the value will be 
required before an update of memory or cache 
store can take place. 

In the case of a LOAD instruction, the alias 
check stage 38 determines whether there is an 
existing entry for the addressed location in the 
SAM. If not, an entry is generated in the SAM, 
associating the calculated address with the phys- 
ical register identified as the destination of the load 
instruction. When the store data returns from the 
read stage 44, it is stored in this physical register, 
and any dependencies associated with this register 
are released. 



If, on the other hand, there is an existing entry 
for the addressed location in the SAM, this means 
that the data from the required store location is 
already held in a physical register. In this case, the 
5 VRM is updated by the alias check stage 38, to 
map the destination register identifier to this phys- 
ical register. As a result, no memory access is 
necessary. 

It can therefore be seen that the allocation of a 
w destination physical register to a LOAD instruction 
by the register allocation unit 20 is only provisional, 
and may be modified subsequently by the address 
generation pipeline after it has performed this 
check. Any following instructions that use this des- 
75 tination register as a source must therefore wait 
until this provisional allocation is confirmed (where 
there is no existing entry in the SAM) or rescinded 
(where there is an existing entry in the SAM). 

In the case where the provisional allocation is 
20 rescinded, the following actions are performed by 
the address generation pipeline: 

(a) The VRM is updated so that the destination 
register of the LOAD instruction is mapped to 
the physical register containing contents of the 

25 addressed store location. 

(b) All instructions containing a reference to the 
physical register to which the destination regis- 
ter of the LOAD instruction was originally 
mapped are modified to include the correct 

30 physical register number. 

(c) The original physical register is de-allocated, 
and added to the free register list 21 . 

The processing unit also includes a termination 
stage 50 which detects the successful termination 

35 of instructions, and records the most recent guar- 
anteed correct state of the processing apparatus. 
The termination stage also identifies any physical 
registers that are eligible to be re-used (where all 
instructions that could have made reference to a 

40 register have now been successfully completed) 
and adds those registers to the list 21 of free 
registers. This list 21 can be accessed by the 
register allocation unit when it requires to allocate a 
new physical register. 

45 A second data processing apparatus in accor- 
dance with the invention will now be described. 

The overall organisation for this apparatus is 
the same as shown in Figure 1 . Figure 3 shows the 
processing unit in more detail. 

so The processing unit contains the following ta- 

bles: 

Visible register map (VRM) 102: this is a table 
indicating associations between the physical 
(hardware) registers and the logical register identifi- 
es ers used by the instructions. 

Register alias table (RAT) 104: this contains a table 
indicating associations between the physical regis- 
ters. The RAT effectively modifies the mapping 
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specified by the VRM, by indicating that the logical 
register associated with a particular physical regis- 
ter is now associated with a different physical reg- 
ister. 

Store alias map (SAM) 106; this contains a table 
indicating associations between virtual memory ad- 
dresses contained in the instructions and the phys- 
ical registers. 

Instructions received from the instruction fetch 
unit are held in a buffer 110. The buffer contains 
the following fields: 

F : function code 

Pd : logical destination register 

Ps/Pm : logical source or modifier register 
Pt/Pb : logical source or base register 
N : literal value 

The instructions in the buffer 110 are trans- 
formed by a register allocation stage 112, and the 
transformed instruction is placed in a buffer 114. 
The transformed instruction includes the following 
fields: 

F : function code 
Hd : hardware destination register 
Hs/Ht: hardware source registers 
Hm : hardware modifier register 
Hb : hardware base register 
N : literal value 

A further buffer 116 receives the values F, Hd, 
Hs, Ht and N from the buffer 114, and also a virtual 
address VA which is generated from Hb, Hm and N 
by an address generation stage 118. 

Finally, the contents of the buffer 116 are pro- 
cessed by an execution and data fetch stage 120. 
The register allocation stage 112 comprises the 
following units: 

Register assignment unit 122. This assigns a 
hardware register to Pd, and updates the VRM 
table. The register assignment unit maintains a list 
of free hardware registers (not shown) which are 
available for assigning. 

First register mapping unit 124. This maps 
Ps/Pm onto a hardware register (Hs or Hm) using 
the VRM and RAT tables. 

Second register mapping unit 126. This maps 
Pt/Pb onto a hardware register (Ht or Hb) using the 
VRM and RAT tables. 

The address generation stage 118 comprises 
the following units: 

Register file access unit 128. This uses the 
field Hb or Hm to access the register file RF to 
obtain a base or modifier value. 

Address generation unit 130. This uses the 
base or modifier and the literal value N to calculate 
the virtual address VA. 

VAT read unit 132. This uses the VA to read 
the SAM to determine whether a hardware register 
H is already associated with this particular VA. If 
so, this is referred to as a SAM hit; otherwise, a 



SAM miss has occurred. 

RAT write unit 134. This uses the value H, in 

the event of a SAM hit, to form an association in 

the RAT table between Hd and H. 
5 SAM write unit 136. This uses the VA and the 

field Hd or Hs to form a new entry in the SAM, 

associating this virtual address with the hardware 

register Hd or Hs. 

Register file access unit 135. This uses the 
w field Hs to access the register file RF to obtain a 

data item. 

Write buffer 137. This receives the virtual ad- 
dress VA, and receives the data item from the RF 
and writes it to the memory at address VA. 
rs The execution stage 120 comprises the follow- 
ing units: 

Register file access unit 138. This receives the 
field Hs or Ht and accesses the register file RF to 
obtain a source operand or operands. 
20 Execution unit 140. This receives the source 

operand(s) and/or the literal N, and performs an 
arithmetic or logical function as specified by the 
function code F. 

Register file access unit 142. This receives the 
25 result of the function, and writes it to the hardware 
register specified by Hd. 

Data fetch unit 144. This receives the virtual 
address VA and accesses the memory at this ad- 
dress, to read a data item. The data item is passed 
30 to the register file access unit 142 for writing to the 
hardware destination register Hd. 

The operation of the apparatus in response to 
the different instruction types will now be de- 
scribed. 

35 

L Pd/(Pb + N) 

This instruction indicates that the virtual ad- 
dress is to be calculated by adding the contents of 

40 a logical base register Pb to a literal value N. The 
memory is then to be accessed, using this virtual 
address, and the data is to be loaded into logical 
destination register Pd. 

In this case, the unit 122 assigns a free hard- 

45 ware register Hd as the destination register, and 
updates the VRM so as to associate this with the 
logical register Pd. At the same time, the unit 126 
consults the RAT and VRM, to find the hardware 
register Hb which corresponds to logical register 

50 Pb. 

The value Hb is used by the unit 128 to access 
the RF so as to obtain the base value. This base 
value is then added by the unit 130 to the literal 
value N, so as to generate the virtual address VA. 
55 The virtual VA is used by the unit 132 to 
access the SAM map to determine whether VA is 
already associated with any hardware register. In 
the event of a SAM miss, the VA is passed to the 
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data fetch unit 144, so as to initiate fetching of the 
required data from the memory. This data is then 
written, by the unit 142, into the destination register 
Hd in the register file. At the same time, VA and 
Hd are sent to the unit 136, which creates a new 
entry in the SAM, associating VA with Hd. 

In the event of a SAM hit, the unit 132 returns 
a value H which identifies a hardware register hold- 
ing the required data value. Hence, a memory 
access is unnecessary and the instruction does not 
pass to the execution stage 120. The values H and 
Hd are used by the unit 134 to form an association 
between the register H and the destination register 
Hd. Hence, it is not necessary in this case actually 
to write the data into the destination register. 

ST/Ps (Pb + N) 

This instruction specifies that a virtual address 
is to be formed by adding the contents of logical 
base register Pb to literal value N. The contents of 
logical source register Ps are then written to the 
memory at this address. 

The values Ps and Pb are used by the map- 
ping units 124 and 126 to access the RAT and 
VRM to find the hardware registers Hs and Hb 
which hold the source operand and base value. 

Hb is used by the unit 128 to access the RF to 
obtain the base value, and this is added to the 
literal N by unit 130, to form the virtual address VA. 

Hs is used by the unit 135 to access the RF to 
obtain the source operand, and this is then written 
to the memory location specified by VA, by way of 
the write buffer. 

At the same time, the VA is used by the unit 
132 to access the SAM table to determine whether 
this virtual address is already associated with one 
of the hardware registers. 

If no association exists, the unit 136 creates a 
new entry in the SAM associating the VA with the 
hardware source register Hs. If, on the other hand, 
an association already exists (SAM hit), the existing 
entry is overwritten with the new entry. 

FUN Pd/Ps, Pt 

This instruction indicates that an arithmetic or 
logical operation, specified by the function code 
FUN, is to be performed on source operands in 
logical registers Ps and Pt, and the result written 
into logical register Pd. 

The unit 122 assigns a free hardware register 
Hd as the destination register, and updates the 
VRM table to associate this with the logical register 
Pd. At the same time, the units 124 and 126 
access the RAT and VRM tables find the hardware 
registers Hs, Ht associated with Ps, Pt. 

The unit 138 then uses the values Hs, Ht to 



access the RF to obtain the operands, and the 
specified function is performed by the unit 140. 
The result is written into hardware register Hd by 
the unit 142. 

5 

M Pd/Ps 

This instruction specifies that the contents of 
logical register Ps are to be moved to logical 
10 register Pd. 

In this case, the unit 122 assigns a hardware 
register Hd as the destination register, and updates 
the VRM to associate this with the logical register 
Pd. 

15 At the same time, the unit 124 accesses the 
RAT and VRM to find which hardware register Hs 
is associated with Ps. 

Finally the unit 134 uses these values Hs, Hd 
to create a new entry in the RAT, associating the 

20 registers Hs and Hd. 

Thus, it can be seen that in this case, no actual 
movement of data between the register is required 
- only the associations between the registers are 
changed. 

25 In summary, it can be seen that in this appara- 
tus, the number of logically redundant operations 
(which do not create new values) is minimised, with 
. a corresponding reduction in the critical paths of 
programs. This is achieved by the use of tables in 

30 such a manner that, wherever possible, data move- 
ments are implemented by changing the associ- 
ations between hardware registers on the one hand 
and logical registers and virtual addresses on the 
other. 

35 

Claims 

1. A data processing apparatus, comprising a plu- 
rality of physical registers (14), means (20, 22; 

40 102, 104, 122, 124, 126) for variably associat- 

ing the physical registers with a plurality of 
logical registers, and means (38; 134) for ex- 
ecuting instructions involving register-to-regis- 
ter transfers by altering the association be- 

45 tween the physical registers and the logical 

registers, without actually transferring data be- 
tween the registers. 

2. Apparatus according to Claim 1 including a 
so memory table (22; 102) for storing a map of 

the correspondences between said physical 
registers and the logical registers. 

3. Apparatus according to either preceding claim 
55 including a further table (48, 106) for storing a 

map of associations between the physical reg- 
isters and memory addresses. 
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Data processing apparatus comprising 

(a) a set of physical registers (14), 

(b) a main memory, 

(c) an instruction fetch unit (18) for fetching 

a sequence of instructions specifying oper- 5 
ations to be performed on the contents of a 
set of logical registers, the Instructions com- 
prising MOVE instructions for moving the 
contents of a specified logical source regis- 
ter to a specified logical destination register, w 
LOAD instructions for loading an operand 
from a specified memory location into a 
specified logical destination register, SAVE 
instructions for saving the contents of a 
specified logical source register to a speci- 75 
fied memory location and ARITHMETIC 
Instructions for performing logical and 
arithmetic operations on the contents of at 
least one specified logical source register, 

(d) register allocation means (20, 22; 102, 20 
104, 122, 124, 126) for forming associations 
between the logical registers and the phys- 
ical registers and 

(e) means (38; 34) responsive to a MOVE 
instruction for varying said associations so 25 
as to associate the logical destination regis- 
ter of the MOVE instruction with a physical 
register associated with the logical source 
register of the MOVE instruction. 

30 

Apparatus according to Claim 4, including a 
register map containing a plurality of entries, 
each entry indicating an association between a 
logical register and a physical register, and 
wherein whenever an instruction specifying a 35 
logical destination register is executed, the 
register allocation means creates an entry in 
the register map indicating an association be- 
tween the logical destination register of the 
instruction and a selected physical register. 40 

Apparatus according to Claim 4 or 5 including 
a register alias table (104) containing a plural- 
ity of entries each entry indicating an associ- 
ation between a pair of physical registers, and 45 
wherein said means for varying said associ- 
ations comprises means for creating an entry 
in said register alias table indicating an associ- 
ation between a physical register associated 
with the logical destination register of said 50 
MOVE instruction and the physical register as- 
sociated with the logical source register of said 
MOVE instruction. 

Apparatus according to any one of Claims 4 to 55 
6 including a store alias map (106) containing 
a plurality of entries, each entry indicating an 
association between a physical register and 



the contents of a specified memory location, 
and including means operative in response to 
a LOAD instruction for consulting the store 
alias map to determine whether the memory 
location specified by the LOAD instruction is 
associated with a physical register and, if so, 
terminating the LOAD instruction without ac- 
cessing the memory. 

8. Apparatus according to Claim 7 when depen- 
dent upon Claim 6, including means operative 
in the event that the memory location specified 
by the LOAD instruction is associated with a 
physical register, for creating an entry in the 
register alias table indicating an association 
between that physical register and the physical 
register associated with the logical destination 
register of the LOAD instruction. 

9. Apparatus according to any of Claims 4 to 8 
wherein the register allocation means modifies 
each instruction by replacing references in the 
instruction to logical registers by reference to 
associated physical registers. 

10. Pipelined data processing apparatus compris- 
ing: 

(a) a set of physical registers, 

(b) a main memory, and 

(c) an instruction pipeline comprising an in- 
struction fetch stage, a register allocation 
stage, an address generation stage, and an 
execution and data fetch stage connected 
together in series, 

(d) wherein the instruction fetch stage com- 
prises means for fetching a sequence of 
instructions specifying operations to be per- 
formed on the contents of a set of logical 
registers, the instructions comprising MOVE 
instructions for moving the contents of a 
specified logical source register to a speci- 
fied logical destination register, LOAD 
instructions for loading an operand from a 
specified memory location into a specified 
logical destination register, SAVE instruc- 
tions for saving the contents of a specified 
logical source register to a specified mem- 
ory location and ARITHMETIC instructions 
for performing logical and arithmetic oper- 
ations on the contents of at least one speci- 
fied logical source register, 

(e) wherein the register allocation stage 
comprises means for forming associations 
between the logical registers and the phys- 
ical registers, 

(f) and wherein the address generation 
stage comprises means responsive to a 
MOVE instruction for varying said associ- 
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ations so as to associate the logical destina- 
tion register of the MOVE instruction with a 
physical register associated with the logical 
source register of the MOVE instruction. 

5 

11. Apparatus according to Claim 10 wherein the 
address generation stage further includes 
means for forming associations between the 
memory locations and the physical registers, 
and means operative in response to a LOAD w 
instruction for determining whether the mem- 
ory location specified by the LOAD instruction 
is associated with a physical register and, if so, 
for varying the associations between the phys- 
ical registers and the logical registers so as to 75 
form an association between that physical reg- 
ister and the logical destination register of the 
LOAD instruction. 

20 
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